Analysis of human leukaemias and lymphomas using extensive 
immunophenotypes from an antibody microarray 
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Summary 

A novel antibody microarray has been developed that provides an extensive 
immunophenotype of leukaemia cells. The assay is a solid phase cell-capture 
technique in which 82 antigens are studied simultaneously. This paper 
presents the analysis of 733 patients with a variety of leukaemias and 
lymphomas from peripheral blood and bone marrow. Discriminant Function 
Analysis of the expression profiles from these 733 patients and 63 normal 
subjects were clustered and showed high levels of consistency with diagnoses 
obtained using conventional clinical and laboratory criteria. The overall levels 
of consensus for classification using the microarray compared with 
established criteria were 93-9% {495/527 patients) for peripheral blood and 
97-6% (201/206 patients) for bone marrow aspirates, showing that the 
extensive phenotype alone was frequently able to classify the disease when the 
leukaemic clone was the dominant cell population present. 
Immunophenotypes for neoplastic cells were distinguishable from normal 
cells when the leukaemic cell count was at least 5 x 10 9 cells/1 in peripheral 
blood, or 20% of cells obtained from bone marrow aspirates. This technique 
may be a useful adjunct to flow cytometry and other methods when an 
extensive phenotype of the leukaemia cell is desired for clinical trials, research 
and prognostic factor analysis. 

Keywords: leukaemia, CD antigens, immunophenotype, expression profile, 
array. 
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Leukaemias are currently diagnosed by a combination of British (FAB) system divides acute myeloid leukaemia (AML) 
morphological analysis, cytochemistry, immunophenotyping, into eight subtypes (M(MvI7) and acute lymphoblastic 
cytogenetics and molecular genetics. The French-American- ieukaemia (ALL) into three subtypes (L1-L3), for both 
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B- and T-cell lineages (Bennett et al, 1976, 1991; Bain, 2003). 
The World Health Organisation (WHO) classification for 
AML builds upon ihe morphological criteria used in the FAB 
system Gaffe et al, 2001). Immunophenotypic analysis has a 
central role in distinguishing between minimally differentiated 
AML and ALL, for recognition of acute megakaryoblastic 
leukaemia, and in the separation of B- and T-cell ALL (Jaffe 
et al, 2001). The biphenotypic acute leukaemias, acute leukae- 
mias with lineage infidelity, minimally differentiated AML, and 
some Precursor ALL can only be distinguished through 
immunophenotyping (Brunning, 2003). The European Group 
for the Immunologic Classification of Leukaemia (EGIL) 
proposed a scoring system to identify biphenotypic leukaemias 
(Bene et al, 1995). Relationships between immunophenotype, 
classification and prognosis of leukaemias have been studied 
extensively (Ferrara et al, 1998; Khaiidi et al, 1998, 2999; Exner 
el al, 2000; Dunphy et al, 2001; Raspadori et al, 2001; Thai- 
ham mer-Scherrer et al, 2002; Munoz et al, 2003). Casasnovas 
et al (2003) developed a classification system for AML based 
on immunophenotypes from seven antigens, defining five 
subtypes, MA-ME, that correlated with prognosis. A more 
extensive immunophenotype should enable more accurate 
diagnostic and prognostic analysis. 

The B-lymphoproliferative disorders (B-LPD) encompass a 
broad range of leukaemias and lymphomas. Distinctions 
between some of the B-LPD may be subtle; for example, chronic 
lymphocytic leukaemia (CLL) and mantle cell leukaemia (MCL) 
differ in the level of CD23, Dl-cyclin and surface immunoglob- 
ulin expression, B-LPD have been classified by the FAB system 
(Bennett et al, 1989), and more recently by the WHO classifi- 
cation (Jaffe et al, 2001 ). The WHO classification divides B-LPD 
into subtypes that correspond to various stages of B-cell 
differentiation, ranging from naive B cells to plasma cells. The 
B-LPD and their relationship to normal B-cell differentiation 
have been reviewed by Shaffer et al (2002); phenotype is a crucial 
component for diagnosis and classification of B-LPD. Some B- 
LPD do not correspond with classic differentiation states, for 
example, hairy cell leukaemia (HCL) and CLL. 

Gene expression profiling may sub- cluster some leukaemias 
and lymphomas (Staudt, 2003). For example, DNA microar- 
rays representing 32 275 unique Unigene clusters (Stanford 
Functional Genomics Facility protocol) have been used by 
Lacayo et al (2004) on samples from childhood AML patients 
to identify FLT3 mutations with good clinical outcomes. 
However, DNA microarrays may be too complex for routine 
screening, and the expression of cellular proteins does not 
always correlate with mRNA levels. A procedure that simul- 
taneously measures the expression of many proteins on the 
surface of leukaemia cells would be an important analytical 
tool. We have developed a cluster of differentiation (CD) 
antibody microarray (DotScan 1M ) containing immobilised 
monoclonal antibody dots ( 10 nL) specific for 82 CD antigens. 
Leucocytes are captured by the corresponding immobilised 
antibody, yielding a dot pattern that defines the surface 
immunophenotype (expression profile, disease signature) of 



those cells (Belov et al, 2001, 2003). Several groups have 
proposed that extensive immunophenotyping should be suf- 
ficient to sub-classify leukaemias. Nguyen et al (2000) devel- 
oped a relational database that included the diagnostic 
immunophenotypes for 33 haematopoietic neoplasms. The 
database included the expression of 42 CD antigens and the 
correct diagnosis was listed in the top 5 differential diagnoses 
for 89% of a panel of 92 clinical cases. It was concluded that a 
pathologist must establish the final diagnosis by correlating 
histological findings with the immunophenotype. Bain (2003) 
compiled a table for AML (M0-M7) that summarised patterns 
of reactivity for categories of AML with antibodies against 
CDlib, CD13, CD14, CD15, CD33, CD34, CD 11 7, HLA-DR 
and TdT (intracellular). These proteins were differentially 
expressed between the subtypes of AML, suggesting that it 
might be possible to reclassify these leukaemias based upon 
extensive immunophenotypes. Flow cytometry has been the 
standard method for cell surface analysis and it remains a 
powerful technique particularly for leucocyte subset analysis, 
but its utility to provide an extensive immunophenotype is 
limited by cost and time. The availability of a simple, internally 
consistent and unbiased system for classification of all 
leukaemias and lymphomas would be a significant advance. 

This study analysed the immunophenotypes from the 
peripheral blood (PB) of 29 normal subjects and 527 patients 
with leukaemia or lymphoma, and from bone marrow 
aspirates (BMA) of 34 control subjects and 206 patients. 
Regularised high-dimensional classification techniques were 
used to define cell surface antigen profiles for each leukaemia 
and its associated subtypes. The data obtained with the CD 
antibody microarray correlated closely with the diagnoses 
provided by 'traditional' methods, including flow cytometric 
immunophenotyping, validating the use of this microarray for 
clinical diagnosis, and opening the possibility for further 
dissecting the biological and clinical heterogeneity of human 
leukaemias and lymphomas. 

Materials and methods 

CD antibody microarrays 

Medsaic Pty Ltd (Eveleigh, NSW, Australia) provided the 
DotScan 1M microarrays, prepared as previously described 
(Belov et al, 2001). Monoclonal antibodies were purchased 
from the following companies: Coulter and Immunotech from 
Beckman Coulter (Gladesville, NSW, Australia), Pharmingen 
from BD Biosciences (North Ryde, NSW, Australia), Biosource 
International from Applied Medical (Stafford City, Qld, 
Australia), Serotec from Australian Laboratory Services (Syd- 
ney Markets, NSW, Australia), Sigma-AJdrich (Castle Hill, 
NSW, Australia), Biotrend, Biodesign and MBL from Jomar 
Diagnostics (Stepney, SA, Australia), Chemicon Australia 
(Boronia, Vic, Australia) and Leinco Technologies (St Louis, 
MO, USA) and Calbiochem from Merck {Kilsyth, Vic, 
Australia). Antibody solutions were reconstituted as 
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c dot patterns for leukaemia cells captured on CD antibody mieroarrays. (A) Key; (8} T-ALL-63772; (C) AML-87878; (D) 
Precursor B-379; (E) B-CtL-382; (F) FL-86065, The numbers in the key refer to positions of CD antibodies; TCRa/p and TCRy/S are T-celi receptors, 
HLA-DR, FMC7, k and ?. are immunoglobulin light chains, and slg is surface immunoglobulin. Samples from patients are identified by their 
abbreviated disease name followed by the assay code number. Leucocytes did not bind to the murine isotype control antibodies, mlgGl, mlgG2a, 
mIgG2b and mlgM (not shown). Details of the fractionation of mononuclear leucocytes and immunophenotyping with CD antibody mieroarrays 
appear in Materials and methods. The intensities of the dots in these patterns were quantified as described in Materials and methods and are presented 
in Table I with corresponding fl 



recommended, and stored in aliquots with 0-1% (w/v) bovine 
serum albumin (BSA) at -80°C, Pharmingen antibodies were 
generally stored at 4°C with BSA added to 0-1% (w/v) before 
dispensing. Antibodies were used for making mieroarrays at 
concentrations ranging from 50 to 1000 tig protein/ ml. 

Immunophenotyping leucocytes from patients 

After obtaining informed consent, blood (5 ml) or bone 
marrow (1 ml) was collected from patients by venepuncture or 
needle aspiration into a tube containing heparin or EDTA. The 
project received approval from the Human Research Ethics 
Committee at Royal Prince Alfred Hospital (Camperdown, 
NSW, Australia; permit no. 99/07/07), and the Hitman 



Research Ethics Committee of St Vincents Hospital (Darling- 
hurst, NSW, Australia; reference number H02/074). Blood 
samples were obtained from Symbion Pathology (North Ryde, 
NSW, Australia), Westmead Millennium Institute (Westmead 
Hospital, Westmead, NSW, Australia), Garvan institute of 
Medical Research (Darlinghurst, NSW, Australia), University 
of Cambridge Department of Haematoiogy (Addenbrooke's 
Hospital, Cambridge, UK), MD Anderson Cancer Center, 
(Houston, TX, USA), Diagnostic Haematoiogy, Melbourne 
Health Shared Pathology Service, Royal Melbourne Hospital 
(Parkville, Vic, Australia), Kanematsu Research Laboratories 
(University of Sydney, NSW, Australia), and the Department 
of Medicine (Nepean Hospital, Penrith, NSW, Australia), Most 
blood samples were analysed within 48 h; approximately 20% 
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of samples were obtained as mononuclear leucocytes frozen in 
a viable state in dimethyl sulphoxide/fetai calf serum using 
established procedures. Control experiments have shown that 
frozen viable cells, when thawed, give a similar immunophe- 
notype to a fresh sample (Belov et al, 2003). Total leukaemic 
cell counts in the samples analysed varied from 4-0-200 x 
10 9 /1. Most of the bone marrow samples were obtained from 
the Garvan Institute of Medical Research (Darlinghurst, NSW, 
Australia); leukaemic cells accounted for at least 10% of 
marrow leucocytes. The project described in this paper was 
approved by all of the above institutions that supplied samples. 

Mononuclear leucocytes were obtained and analysed from 
blood and bone marrow samples as previously described (Belov 
et al, 2003). Briefly, a 300 pi aliquot of the leucocyte suspension 
(4 x 10 s cells) was incubated for 30 min on the microarray, 
then unbound cells were removed by gentle washing. Captured 
cells were fixed and imaged using DotReader™ and dot 



intensities were quantified for each antigen in duplicate, using 
data analysis software on an 8-bit pixel greyness scale from 0 to 
255 that reflected the level of expression of a particular antigen 
and the proportion of celts expressing that antigen (Belov et al, 
2003). The dot pattern obtained is the immunophenotype of 
that population of leucocytes; immunophenotypes for neoplas- 
tic cells were distinguishable from normal cells when the 
leukaemic count was at least 5 x 10 9 cells/1 in peripheral blood, 
or 20% of cells obtained from BMA. As the leukaemic cell count 
decreases, the binding pattern reflects the increasing proportion 
of normal leucocytes, until the clonal population is diluted 
below the limit of detection. 

Signature expression profiles 

All computations were performed using the R statistical 
language environment (R Development Core Team, 2005). 



of immuitiophenotYpic data from CD antibody mkroarrays and flow cytometry for analysis of the kukaei 



Precursor B-379 



Microarray (DotScan™) data are presented as a percentage of the maximum intensity of 100%, while flow cytometry (FC) results are presented as 
percentage of a gated population (lymphocytes arsd/or blast cells). 
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The immunophenotypes of 556 PB samples and 240 BMA 
were grouped according to a hierarchical model for classi- 
fication of leukaemias and lymphomas. Signature expression 
profiles, plotted as bar charts, were generated for each 
classification of leukaemia/lymphoma analysed, showing 
average cell binding intensities for the 82 surface antigens. 
Each bar chart was scaled to occupy the full 0-255 greyness 
scale. The data for the signature expression profiles, means 
and standard errors, have been provided as Supplementary 
Material (Tables Si and S2). 

The dot patterns (e.g. Fig 1) were used to predict the 
classification of leukaemias by using cross- validated regular- 
ised quadratic discriminant analysis. This statistical method 
of classification can be Baves optima! ("V enables & Ripley, 
2002) under certain conditions ot the data distribution (e.g. 
when the multivariate data are normally distributed). 
Regularised quadratic discriminant analysis (Friedman, 
1989) incorporates two further shrinkage parameters that 
allow for more flexibility when modelling the data distribu- 
tion. The number of parameters is kept low to avoid over- 
training. As m all statistical classification methods, the 
expression profiles ot the various samples are used to train 
the statistical classification algorithm, and thus to generate 
specific signatures tor each leukaemia category. The algorithm 
uses 'leave one out cross-validation ; all observations were 
used to train the classifier and all observations were used to 
test the classifier but no observation was used to train and 
test the classifier at the same time, ensuring that the 
predictions were not over-optimistic. Cross-validated quad- 
ratic discriminant analysis was carried out for each disease 
category and the results expressed as classification consensus, 
i.e. clinical diagnosis l defined as reported by the diagnostic 
pathology laboratory or haernatologist) versus DotScan iM 
microarray diagnosis. 

Classification score 

Graphical displays of the results of comparisons between two 
disease categories were obtained using a reduced-space gener- 
alised linear model. The generalised linear model was applied 
to a low-dimensional summary of the antibody results for each 
assay (based on principal component scores). These principal 
component scores were used to generate the linear predictor in 
a binomial error model with logit link function (Venables & 
Ripley, 2002). For new observations (i.e. observations not used 
in fitting the generalised linear model), predicted values were 



generated on the link scale (McCullagh & Nelder, 1999), 
translated such that the prediction boundary between groups 
was at zero. Throughout this paper, these predicted values on 
the logit scale are referred to as classification scores. A 
classification score greater than zero implies predicted 
membership of one disease group; whilst a classification score 
less than zero implies predicted membership of the other 
disease group. The calculation of classification scores is 
summarised in Appendix SI. 

Discriminant functions analysis 

The immunophenotypic data are highly multi-dimensional, 
and to assist visualisation were proiected onto an appropriate 
three-dimensional space that provided the best separation, 
there are several wavs m which this projection can be made. 
The hrst is principal component analysis that protects the data 
along the directions showing maximum variance. Another 
option is linear discriminant analysis, where directions are 
chosen to maximise between-group separation. hot high 
dimensional data, it is preferable to perform principal 
component analysis first utilising a large number ot compo- 
nents (n = 15 components were chosen for the immuno- 
phenotvpic data), followed bv a discriminant analysis that 
improves separation, lhe immunophenotypic data were 
visualised by protection onto the space ot the hrst three 
discriminant functions. Alter the three-dimensional projec- 
tions were chosen, thev were rotated and inspected manually, 
and the two-dimensional snapshots that presented the best 
separations were selected. In the case of those immunopheno- 
tvpes that clustered, we concluded that the samples belonged 
to the same disease category and thev were compared with 
their clinical diagnoses. 

Signature tightness indicators 

The disease classification algorithm relies on spatial informa- 
tion on the location and shape of the disease cluster that is 
provided by the centroid of the cluster and the variance- 
covariance matrix. The mean Euclidean Distance to the 
centroid (E) was calculated from the log-normalised cell 
binding intensity data, for each disease category. E is the 
average distance from each assay to the centroid of the 
respective disease cluster, a simple measure of the size of 
the cluster. A targe value for E corresponds to a more spread- 



Fig 2. Expression profiles for acute leukaemias. (A) peripheral blood; (B) bone marrow aspirates. Leukaemias are denoted by acronyms defined in the 
text; numbers of patient samples tested are in brackets. The bar charts have been scaled to give a maximum intensity value of 255 on the 8-bit greyness 
scale as described in Materials and methods. The staggered numbers on the abscissa refer to CD antigens; their abbreviations are defined in the legend 
to Fig I . Signature tightness expressed as values for the mean Euclidean Distance to the centroid (E) for the cluster of dot patterns for each clinical 
subtype were for (A) PB: Normal, 7-00; AML, 978; Precursor-B-ALL, 9-03; T-ALL, 9-93. (B) BMA: Control, 6-40; AML, 972; Precursor-E-ALL, 8-48; 
T-ALL, 8 53. 
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Results 

Dot patterns for leukaemias 

Mononuclear leucocytes from leukaemia patients, diagnosed 
using established criteria, were immunophenotyped using the 
CD antibody microarray. Examples of dot patterns are shown 
in Fig 1: T-ALL (Fig IB), AML (Fig 1C), Precursor B-ALL 
(Precursor B, Fig ID), B-CLL (Fig IE), and follicular lym- 
phoma (Fig IF), Each of these B-cell neoplasms strongly 
expressed the B-cell antigens CD19 and CD20, but expression 
of other B-cell antigens (CD21, CD22, CD23, CD79b and 
immunoglobulin light chains) was variable. Table I compares 
DotScan™ values (expressed as a percentage of the most 
intense dot on the microarray) with flow cytometric data for 
the same patient obtained from a pathology laboratory. The 
data available from flow cytometry were limited by the use of 
time and reagents, but the data tabulated for particular CD 
antigens showed very good correspondence between the level 
of positivity from flow cytometry and the corresponding 
intensity percentage from the antibody microarray. 

Expression profiles for acute leukaemias and 
B-lymphoproliferative disorders 

A comparison of the average expression profiles for Hist- 
opaque-prepared PB- Normal (n = 29) and BMA-Control 
(n = 34) samples (Fig 2A and B) showed that the PB-Normal 
population expressed antigens predominantly associated with 
T cells (TCRa/p, CD2, CD3, CD4, CD5, CD7 and CD28), with 
low levels of B cell (CD19-CD23, kappa, lambda), platelet 
(CD41, CD42a, CD61) and myeloid (CD13, CD14 and CD33) 
antigens, while the BMA-control samples showed mainly 
immature granulocyte antigens (CD13, CD 1 5, CD16, 
C.D24, CD43, CD45RO and CD66c), with relatively low levels 
of T- and B-cell antigens. Expression of CD 11 a, CD31, CD43, 
CD44 and CD45 was high (cell binding density > 100) in both 
populations; CD9, CD29, CD36, CD45RA, CD49d, CD49e and 
CD52 were higher in PB, while CD lib and CD45RO were 
higher in BMA. These results reflect the normal mix of cell 
lineages seen in blood and marrow samples. 

The average expression profile for AML samples from the PB 
(n = 99) was similar to that of BMA {n = 112, Fig 2A and B), 
with high expression of CD 11a, CD29, CD31, CD38, CD43, 
CD44, CD45, CD45RA, CD49d, CD49e and CD71, and lower 
expression (cell binding density 20-100) of CD2, CD4, CD9, 
CDllb, CDlic, CD13, CD15, CD33, CD36, CD45RO, CD52, 
CD64 and HLA-DR. The average expression of GDI 17 was 
higher in AML samples taken from BMA than PB. Profiles for 



Precursor B-ALL samples from the PB (n = 37) and BMA 
(?j = 65) were also comparable (Fig 2 A and B), with high 
expression of CD9, CD10, CD19, CD22, CD29, CD31, CD38, 
CD43, CD44, CD45RA, CD49d, CD49e and HLA-DR, and 
Sower expression of CDlla, CD24, CD45, CD52, CD66c and 
CD71. The average expression of CD34 was higher in BMA 
than PB. Profiles for T-ALL samples from the PB (n = 8) and 
BMA (n - 9) were comparable (Fig 2A and B), showing high 
expression of CDS, CD7, CDlla, CD29, CD31, CD3S, CD43, 
CD44, CD45, CD49d, CD49e and CD71 and lower expression 
of CD4 and CD95. 

Fig 3 shows expression profiles for AML cases with assigned 
M0-M7 subtypes from PB (n = 31; Fig 3 A) and BMA 
(n = 43; Fig 3B). The profiles for PB and BMA were similar 
for a given subtype of AML. Alignment of profiles for different 
subtypes from PB or BMA showed differential expression of 
CD antigens. The expression profile for M3 AML corresponds 
well with reported immunophenotypes, showing little or no 
expression of CDllb, CD 11c, CD 15, CD34, CD45RO, CD86 
and HLA-DR, and overexpression of CD9 (Miyachi et al, 1999; 
Re el al, 2002; Paietta et al, 2004). AML M4 and M5 expressed 
CD4, CDllb, CD11C.CD15, CD36, CD64 andCD86 (Brouwer 
et al, 2000; Legrand et al, 2000; Dunphy et al, 2004). CD4, 
CD15 and CD36 were expressed by AML M6. AML M7 
expressed the megakaryocytic antigens CD9, CD41 and CD61 
(Paredes-Aguilera et al, 2003; Yatomi et al, 2004), HLA-DR 
and CD36 (Khalidi et al, 1998). CD13, CD33 and CD117 
contributed to the profiles of all subtypes of AML (M0-M7, 
Legrand et al, 2000). HLA-DR contributed to the profiles of all 
subtypes of AML except for M3 (Legrand et al, 2000). 

Peripheral blood samples from a range of B-LPD (Fig 4A) 
including B-CLL (n = 313), follicular lymphoma (FL; n = 3), 
HCL (n = 8), MCL (n = 10) and non-Hodgkin lymphoma 
(NHL) of unclassified subtype (n = 23), showed strong 
expression of CD19, CD22, CD29, CD31, CD37, CD40, 
CD44, CD45, CD45RA, CD52 and HLA-DR, while CD9, 
CD54 and FMC7 were relatively low. Several antigens were 
differentially expressed: CDS (B-CLL and MCL), CD23, CD43 
(high in B-CLL), CD10 and CD38 (high in FL), CDllb 
(MCL), CDUc (HCL), CD49d (low in B-CLL), CD79b (high 
in FL), CD 103 (HCL) and slg (high in FL, MCL, NHL). 

The NHL category includes different subtypes with periph- 
eral blood or bone marrow involvement. Many of these 
individuals did not have contemporaneous nodal histology 
available and therefore the 'signature' for NHL does not define 
a single disease entity, but is a composite pattern for a range of 
B-LPD excluding CLL, HCL and proiymphocytic leukaemia 
(PLL). The expression profiles of B-CLL (n = 11), FL (n = 1) 
and NHL (n = 8) from BMA (Fig 4B) were similar to those of 



Fig 3. Expression profiles of subtypes of AML. (A) peripheral blood; (B) bone marrow aspirates. The bar charts were compiled and scaled as for Fig 2 
using samples classified by the FAB system for subtypes of AML (M0-M73; numbers of samples are in brackets. Signature tightness expressed as values 
for the mean Eudidean Distance to the centroid (E) for the duster of dot patterns for each clinical subtype were for (A) PB; AML M0, no value; AML 
Ml, 6 43; AML M2, 12-7; AML M3, 6-27; AML M4, 8-04; AML M5, 8-19; AML M6, 7-30. (A) BMA: AML M0, 7-57; AML Ml, 644; AML M2, 7-67; 
AMI. M3, 6-14; AML M4, 10-6; AML M5, 10-2; AML M6, 4 50; AML M7, 4-92. 

© 2006 The Authors 

Journal Compilation © 2006 Blackweil Publishing Ltd, British Journal of Haematology, 135, 184-197 191 



L. Belov et a! 




PB-NORMAL ( 29 ) 



,JlilliJui Xji 

il.M..JllL 



Antigen 



jiLiJk_ 



BMA-CONTROL ( 34 ) 




Fig 4. Expression profiles for B-lymphoproliferative disorders. (A) Peripheral blood; (B) bone marrow aspirates. The bar charts were compiled as for 
Fig 2. Signature tightness expressed as values for the mean Euclidean Distance to the centroid (E) for the cluster of dot patterns for each clinical subtype 
were for (A) PB: Normal, 7-00; B-CLL, 874; FL, 6 61; HCL, 7-99; MCL, 9-04; NHL, 10-3. (B) BMA: Control, 6-40; B-CLL, 8-23; FL, no value; NHL, 7-68. 
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PB. The striking depth and complexity of the data may 
potentially enable farther sub-classification of leukaemias and 
lymphomas using cell surface expression profiles alone. 



o determine the Threshold of 



Use of classification 
detection for CLL 

A model system was developed to determine the threshold 
proportion of CLL cells from PB that provide a distinctive dot 
pattern (disease signature) enabling the correct diagnosis to be 
made. A series of dilutions of CLL cells was prepared in a 
suspension of normal mononuclear leucocytes with the same 
total cell density (L3 x I0 7 cells/ml). A classification score 
based on a reduced space linear model was used for CLL versus 
normal samples derived from the entire database. This 
classifier was then used to predict whether each dilution was 
CLL or norma] peripheral blood leucocytes. The classification 
was derived as described above (from the logit scale-predicted 
values obtained with the generalised linear model). More 
extreme values of the score statistic provided more evidence in 
favour of one of the categories (CLL/normaS). The value of the 
classification score was plotted against the known mixing 
are shown in Fig 5 for CLL cells from 
are linear and showed a threshold for 
detection of the CLL dot pattern that varied from 0-08 to 0-23 
CLL ceils of the suspensions. Thus, a threshold of 20% 



proportion, Four plot 
four patients, The plot; 



leukaemic cells was a minimal threshold for detection of a CLL 
dot pattern against a background of non 
leucocytes. A conservative threshold wo 1 
leukaemia cells against a normal background of 3-6 x I0 9 /1 
total normal leucocytes. 

Correlation between clinical diagnoses and statistical 
clustering 

Plots of discriminant function analysis for 330 acute leukaemias 
(n = 144 from PB; n = 186 from BMA; Fig 6) showed distinct 
dusters of expression profiles that corresponded with the 
clinical diagnoses for AML (n = 99 from PB; n = 112 from 
BMA), Precursor B-ALL {n = 37 from PB; n = 65 from BMA) 
and T-ALL (n = 8 from PB; n = 9 from BMA). These 
leukaemia clusters were distinct from each other and normal 
PB leucocytes (PB-Normal; n = 29) and control bone marrow 
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Fig 7. Discriminant function analysis plot for B-lymphoproiiferative 
disorders. (A) Peripheral blood; (BJ bone marrow aspirates. Symbols 
denoting each leukaemia and the numbers of samples tested are shown 

aspirates (BMA-Control; n = 34). Thus, the microarray was 
able to distinguish between these types of acute leukaemia. A 
similar analysis of 357 B-LPD from PB showed distinct clusters 
of 313 B-CLL and 29 PB-Normal, with the less numerous 
B-LPD in overlapping groups {Fig 7A). These subtypes of 
B-LPD: PL (ft = 3), HCL (n = 8), MCL [n = 10) and NHL 
(n — 23), clustered in the three-dimensional space of the 
analysis, with some overlap of the mixed NHL group with MCL 
and FL (Fig 4). More patient data would define these profiles 
with greater precision. Discriminant function analysis of BMA 
samples of B-CLL (n = 11), NHL (n = 8), FL (n = 1) and 
BMA-Controls (n — 34) also showed distinct clusters (Pig 7B). 

Determination of classification consensus 

A total of 796 samples (556 PB and 240 BMA) were analysed 
using the quadratic regularised discriminant analysis for 
classification consensus, based solely upon an extensive 
immunophenotype and compared with the clinical diagnosis 
provided by individual institutions (Materials and methods). 
The consensus rate for PB was high (96 6-100%) for most of 
the disease categories (PB-Normal, A ML, B-CLL, HCL and 
Precursor B-ALL, data not shown), That is, the microarray 
classification agreed with the diagnosis by pathology laborat- 



ories using conventional criteria. T-ceil leukaemias were 
distinguished from normal samples and other disease categ- 
ories in 88-2% of PB samples. However, 50% of T-ALL 
clustered with mature T-LPD, while 15-4% of mature T-LPD 
clustered with T-ALL, suggesting that intracellular TdT 
detection may be required to conclusively distinguish between 
these two categories, or alternatively, discriminatory disease 
signatures may be defined from extensive immunophenotvpes 
when larger numbers of samples are analysed in each category. 
A few leukaemia samples clustered with PB-Normal or BMA- 
Control because their immunophenotypes were masked by a 
predominantly normal population of cells. Detection of 
abnormal cells became progressively more difficult when the 
leukaemic population was below 5 x 10 9 cells/1 in PB or <20% 
of cells in BMA. In other cases, discordance could be attributed 
to lineage infidelity, sample quality or to small numbers of 
patients rather than to the performance of the assay. 

Discriminatory antibodies and signature tightness 
Antibodies in the microarray were ranked for discrimination 
between diseases. The overall ranking of the antibodies on the 
microarray was generated by means of a single factor analysis 
of variance (anova) of the log-normalised cell binding 
intensity data for each antibody. The effectiveness of an 
antibody in separating all the respective disease categories 
included in an analysts was evaluated by the standard 
F-statistic, defined as the ratio of the mean square between 
groups (disease categories) and the mean square within 
groups. The associated P-value measuring the level of signi- 
ficance was also computed for each particular antibody. For 
PB, all 82 antibodies contributed significantly to the discrim- 
ination of diseases (P < 0*0005). For BMA, 60 of 82 antibodies 
were significantly discriminatory (P < 0-05). The high propor- 
tion of CD antibodies that discriminated between the subtypes 
of leukaemias and lymphomas was not surprising, since they 
were selected from many publications describing expression of 
CD antigens on leukaemias. 

Signature tightness expressed as values for the mean Euclid- 
ean Distance to the centroid (E) for the cluster of dot patterns 
for each clinical subtype (listed in legends to Figs 2-4) were 
higher for clusters of dot patterns that showed significant 
variation. Values of >9-0 were obtained for PB samples of AML 
(9-78), Precursor-B-ALL (9'03), T-ALL (9-93), MCL (9'04), 
NHL (10-3), and AML M2 (12-7), compared with normals 
(7-00). The higher values indicated heterogeneity in the dot 
patterns for these diseases and either the need for more samples 
to define the signature (expression profile), or the potential for 
sub- classification of the clinical disease entity, The latter 
alternative is certainly true for AML, T-ALL and NHL. 

Discussion 

Leukaemias and lymphomas are currently diagnosed using 
criteria that encompass morphology, immunophenotypic 
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analysis, cytochemistry and cytogenetics. Development of 
diagnostic criteria based solely on extensive immunopheno- 
types has been limited by the multiple assays required using 
flow cytometry, The CD antibody microarray enables popu- 
lations of mononuclear leucocytes to be screened using a large 
panel of antibodies in one assay. Results presented here show 
that the microarray distinguished the major categories of 
haematoiogical neoplasms with high accuracy when those cells 
were above a threshold relative to normal cells (5 x 10 9 cells/i 
for PB, and 20% of cells in BMA). Comparison of the 
expression profiles of CD antigens of unknown samples with 
stored consensus signatures defines subtypes of acute leukae- 
mias (AML, B-ALL and T-ALL; Fig 2), and B-LPD (B-CLL, 
HCL and NHL including FL and MCL; Fig 4). 

There are many immunophenotypic differences between the 
acute leukaemias (Fig 2). The lineage discriminatory AML 
antigens are CDllb, CD 11c, CD13, CDS 5, CD33, CD36, CD64 
and CD 117; the Precursor B antigens are CD10, CD19, CD22, 
CD24, CD66c and HLA-DR; the T-ALL antigens are CD2, 
CD5, CD7 and CD28. Immunophenotypic differences within a 
classification may aid in defining subtypes from a larger 
database. AML is especially diverse, and extensive immuno- 
phenotypes obtained with the CD antibody microarray may 
provide the basis for reclassification of subtypes and provide 
correlations for prognosis and treatment. Although the 
expression profiles shown in Fig 3 indicate differential expres- 
sion of a number of antigens in the AML subtypes M0-M7, 
antigen expression may vary within each subtype. 

The diagnosis of CLL is currently based on a lymphocytosis 
of >5 x lO'/l, and a characteristic lymphocyte morphology and 
immunophenotype (Oscier et al, 2004). CLL typically expres- 
ses CDS, CD 19, CD23 and weak rnonotypic surface immuno- 
globulin (slg), CD20 of variable intensity, while CD22, CD79b 
and FMC7 are often weakly expressed or absent (Jaffe et al, 
2001; Bain, 2003; Oscier et al, 2004; Fig 4). This immunophe- 
notype is consistent with data from the microarray. Differences 
in the expression profiles of B-LPD (see Fig 4) suggest that the 
CD antibody microarray can distinguish HCL and NHL 
(including FL and MCL) from B-CLL, although more data are 
required to define consensus immunophenotypes. Stable and 
progressive CLL may be identified by a variety of prognostic 
factors, such as clinical stage, mutational status of the 
immunoglobulin variable heavy chain gene (IgV H ), cytoplas- 
mic Zap-70 and surface CD38 expression. We are currently 
correlating microarray phenotypes within B-CLL patient 
groups that have full prognostic factor analysis and clinical 
outcome data. The microarray has recently been extended with 
a 'satellite array' containing antibodies against cytokine, 
adhesion and apoptotk receptors that may be differentially 
expressed on subtypes of B-CLL. 

In conclusion, there is close correlation between data 
obtained from the CD antibody microarray and traditional 
diagnostic methods across a wide range of leukaemias 
(Table I), but more data are needed for uncommon cases of 
leukaemia to better define expression profiles. The clustering of 



dot patterns (expression profiles, immunophenotypes, disease 
signatures) for acute leukaemias (Fig 6) and B-LPD (Fig 7) 
from PB and BMA validated the use of CD antibody 
microarrays for diagnosis. For the data from 796 subjects 
presented here, the overall levels of consensus were 93-9% 
(495/527) for PB and 97-6% (201/206) for BMA. The 
microarray provides a simple, unbiased procedure for 
immunophenotyping with far more information than flow 
cytometry without subjective gating. The extensive immuno- 
phenotypes obtained also have the potential to identify 
unusual CD antigens as targets for therapeutic antibodies. 
For flow cytometry, intracellular antigens, such as TdT, are 
also screened as markers to distinguish subtypes of leukaemia. 
With the extensive immunophenotype available from the 
microarray, intracellular antigens have not been required so far 
to distinguish subtypes except, perhaps for T-ALL from T- 
LPD. However, procedures have been developed to test cells 
captured on the microarray for expression of the intracellular 
antigens abl-bcr and TdT that wiil be the subject of a later 
report (unpublished observations). 

The microarray has been used to quantify immunopheno- 
typic changes induced by drugs on leukaemia and lymphoma 
cell lines (e.g. White et al, 2005) with the data obtained on 
different microarrays normalised on a maximum dot intensity 
of 255 (cf. Figs 3-5), or by using an interna! standard such as 
CD44. The microarray is also being used for analysis of other 
diseases that affect sub-popuiations of leucocytes, for example, 
infection with human immunodeficiency virus (Woolfson 
et al, 2005). An extensive immunophenotype may enable 
systematic and unbiased sub- classification of AML and B-LPD 
based upon a single assay, and has the potential to detect new 
disease entities and prognostic groups. 
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